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BACKGROUND OF THE INVENTION 

[0002] This invention relates generally to data storage systems, and more 
particularly to data storage systems having redundancy arrangements to protect against total 
system failure in the event of a failure in a component or subassembly of the storage system. 

[0003] As is known in the art, large host computers and servers (collectively 
referred to herein as "host computer/servers") require large capacity data storage systems. 
These large computer/servers generally includes data processors, which perform many 
operations on data introduced to the host computer/server through peripherals including the data 
storage system. The results of these operations are output to peripherals, including the storage 
system. 

[0004] One type of data storage system is a magnetic disk storage system. Here, as 
shown in FIG. 1, a bank of disk drives and the host computer/server are coupled together 
through a system interface. The interface includes "front end" or host computer/server 
controllers (or directors) and "back-end" or disk controllers (or directors). The interface 



operates the controllers (or directors) in such a way that they are transparent to the host 
computer/server. That is, data is stored in, and retrieved from, the bank of disk drives in such a 
way that the host computer/server merely thinks it is operating with its own local disk drive. 
One such system is described in U.S. Patent 5,206,939, entitled "System and Method for Disk 

5 Mapping and Data Retrieval 1 ', inventors Moshe Yanai, Natan Vishlitzky, Bruno Alterescu and 
Daniel Castel, issued April 27, 1993, and assigned to the same assignee as the present invention. 

[0005] As described in such U.S. Patent, the interface may also include, in addition 
to the host computer/server controllers (or directors) and disk controllers (or directors), 
addressable cache memories. The cache memory is a semiconductor memory and is provided to 

10 rapidly store data from the host computer/server before storage in the disk drives, and, on the 
other hand, store data from the disk drives prior to being sent to the host computer/server. The 
cache memory being a semiconductor memory, as distinguished from a magnetic memory as in 
the case of the disk drives, is much faster than the disk drives in reading and writing data. 
[0006] As is also known in the art it is desirable to provide an interface with 

15 maximum data bandwidth, minimum system latency and minimum cost. 

SUMMARY OF THE INVENTION 

[0007] In accordance with the present invention, a system interface is provided 
having a packet switching network, a cache memory, and a plurality of directors, one portion 
of such directors being adapted for coupling to a host computer/server and another portion of 
20 the directors being adapted for coupling to a bank of disk drives. The plurality of directors 
and cache memory are interconnected through the packet switching network. 

[0008] In one embodiment, each one of the directors is coupled to a crossbar switch. 
The cross bar switch is directly connected to at least two other ones of the cross bar switches 
networks and indirectly connected to of other ones of the crossbar switches through the at 
25 least two directly connected crossbar switches. 

[0009] In such embodiment, the packet switching network comprises the crossbar 
switches of the directly and indirectly connected crossbar switches. 

[0010] In one embodiment, the system interface includes a plurality of the packet 
switching networks. Each one of the packet switching networks includes: a plurality of cross 
30 bar switches. Each cross bar switch is connected: (1) directly to crossbar switches of at least 
two other ones of crossbar switches in such one of the packet switching networks and to one 



of the cross bar switches of a second one of the plurality of packet switching networks; and, 
(2) indirectly to other ones of the crossbar switches of such one of the packet switching 
networks via the crossbar switch directly connected to such one of the packet switching 
network. Two of said at least two other ones of the crossbar switches in such one of the 
packet switching networks are connected indirectly to two cross bar switches of the second 
one of the packet switching networks. The two cross bar switches of the second one of the 
packet switching networks are cross bar switches connected indirectly to said one of the cross 
bar switches of the second one of the plurality of packet switching networks. 

[001 1] In one embodiment, the cache memory includes a plurality of cache memory 
sections, each one of such sections being coupled to a corresponding one of the crossbar 
switches. 

[0012] In one embodiment, the system interface includes a common resource section 
for sharing a resource among the plurality of directors. 

[0013] In one embodiment, the common shared resource section includes a shared 
computer code used by the plurality of directors. The code includes computer code for 
booting up each one of the plurality directors. The common shared code storage section is 
interconnected to the directors through the packet switching network. 

[0014] In one embodiment, the system interface includes a second, redundant 
common shared code storage section having stored computer code used by the plurality of 
directors in the event the first mentioned one of the common shared code storage section 
fails. 

[0015] In one embodiment, the system interface includes a second, redundant packet 
switching network for use in interconnecting the directors in the event the first mentioned 
one of the shared code storage section fails. 

[0016] In one embodiment, the system interface includes a cache memory manager 
interconnected to the directors and the cache memory through the packet switching network. 
The cache memory manager provides an interface between the host computer, the bank of 
disk drives and the cache memory for determining for the directors whether data to be read 
from the disk drives, or data to be written to the disk drives, resides in the cache memory. 

[0017] In accordance with another feature of the invention, a queuing system is 
provided. The system includes at least one input/output (I/O) interface having an outbound 
queue and a plurality of processing units coupled to the at least one I/O interface, each one of 
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the processing units being coupled to a corresponding processing unit memory. Each one of 
the processing unit memories has an inbound queue for such coupled processing unit. The at 
least one I/O interface outbound queue stores outbound information being returned to the I/O 
interface after being processed by one of the processing units. 

[0018] In one embodiment, the I/O interface creates queue indices for storage in the 
inbound queues of the processor unit memories. 

[0019] In one embodiment, the queuing includes a plurality of interconnected 
directors. Each one of the directors includes: an input/output interface section for receiving 
information from a source thereof and for returning information to interface to such source; 
and, a microprocessor for processing information sent thereto from a remote one of the 
directors. Each one of the microprocessors has a CPU and a CPU memory, such CPU 
memory storing a queue for inbound information passed to such director for processing 
therein such information being sent to the remote director from an originating one of the 
directors. Each one of the input/output sections includes a queue for outbound information 
being returned to the source through such originating one of the directors after being 
processed by the microprocessor of such remote one of the directors. 

[0020] In one embodiment, each one of the directors includes a translation table, such 
table storing at a location thereof corresponding to each one of the remote directors a 
producer index for the queue of such remote director and a consumer index for such one of 
the remote directors. 

[0021] The details of one or more embodiments of the invention are set forth in the 
accompanying drawings and the description below. Other features, objects, and advantages 
of the invention will be apparent from the description and drawings, and from the claims. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0022] These and other features of the invention will become more readily apparent 
from the following detailed description when read together with the accompanying drawings, 
in which: 

[0023] FIG. 1 is a block diagram of a data storage system according to the PRIOR 

ART; 

[0024] FIG. 2 is a block diagram of a data storage system according to the invention; 
[0025] FIG. 2A is a diagram of a packet used in the data storage system of FIG. 2; 



[0026] FIG. 3 is a block diagram of an exemplary one of the directors used in the 
system of FIG. 2; 

[0027] FIG. 4 is a block diagram of a data storage system according to an alternative 
embodiment of the invention; 
5 [0028] FIG. 5 is a block diagram of a data storage system according to an alternative 

embodiment of the invention; 

[0029] FIG. 6 is a block diagram of an exemplary director/memory board adapted for 
use in the system of FIG. 5; 

[0030] FIG. 7 is a block diagram of a data storage system according to another 
10 embodiment of the invention; 

[0031] FIG. 7A is a block diagram of a data storage system according to another 
embodiment of the invention; 

[0032] FIG. 8 is a block diagram of a data storage system according to the invention 
as configured in FIG. 7 having eight director/memory boards; 
15 [0033] FIG. 9A is a diagram of a data storage system according to the invention as 

configured in FIG. 7 having sixteen director/memory boards; 

[0034] FIG. 9B is a diagram of a data storage system according to the invention as 
configured in FIG. 7 having sixteen director/memory boards interconnected differently from 
the interconnections shown in FIGS. 9A, here such interconnections having a minimum 
20 number of nodal transport hops; 

[0035] FIG. 9C is a diagram of a data storage system according to the invention as 
configured in FIG. 7 having thirty-two director/memory boards interconnected according to 
the invention; 

[0036] FIG. 10 is a block diagram of a data storage system according to yet another 
25 embodiment of the invention; 

[0037] FIG. 1 1 is a block diagram of a data storage system according to yet another 
embodiment of the invention, here such system having a shared code storage section; 

[0038] FIG. 12 is a more detailed block diagram of a director used in the system of 

FIG. 2; 

30 [0039] FIG. 13 is a block diagram of a data storage system having a shared code 

storage section according to yet another embodiment of the invention; 



5 



[0040] FIG. 14 is a more detailed block diagram of a pair of the directors shown in 

FIG. 3; 

[0041] FIG. 15 is a flow diagram shows the processing of inbound information (i.e., 
messages) from an originating director of FIG 14, to the remote director of FIG. 14; 
5 [0042] FIG. 16 is a flow diagram shows the processing of outbound information (i.e., 

messages) from the remote director of FIG. 14 to the originating director of FIG. 14; and 

[0043] FIG. 17 is a diagram of a translation table used in the flow diagrams of FIGS. 
15 and 16. 

[0044] The details of one or more embodiments of the invention are set forth in the 

10 accompanying drawings and the description below. Other features, objects, and advantages 
of the invention will be apparent from the description and drawings, and from the claims. 
[0045] Like reference symbols in the various drawings indicate like elements. 

DETAILED DESCRIPTION 
[0046] Referring now to FIG. 2, a data storage system 10 is shown for transferring 

15 data between a host computer/server 12 and a bank of disk drives 14 through a system 

interface 16. The system interface 16 includes: a plurality of, here for example four, front- 
end directors I81-I84 coupled to the host computer/server 12; a plurality of, here for example, 
four, back-end directors 2O1-2O4 coupled to the bank of disk drives 14. The interface 16 
includes a data transfer section 24, having a global cache memory 22. The global cache 

20 memory includes a plurality of, here, n, global cache memory sections 22 a -22 n? as shown. 
The front-end and back-end directors I81-I84, 20i-20 4 and the global cache memory 
communicate with one another through a packet switching network. The packet switching 
network includes crossbar switches 32 coupled to each one the directors, as shown, and a 
packet switching network section 26. 

25 [0047] The front-end and back-end directors I81-I84, 20i-20 4 are functionally similar 

and include a microprocessor (jaP) 29 (i.e., a central processing unit (CPU) and RAM), a 
message engine/CPU controller having a message engine and a memory controller 37; and, a 
data pipe 38, arranged as shown and described in more detail in co-pending patent 
application serial no. 09/540,828 filed March 31, 2000, inventor Yuval Ofek et al., assigned 

30 to the same assignee as the present invention, the entire subject matter thereof being 

incorporated by reference. Suffice it to say here, however, that the front-end and back-end 

directors 1 8 1 - 1 8 4j 20 1 -20 4 control data transfer between the host computer/server 1 2 and the 
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bank of disk drives 14 in response to messages passing between the directors I81-1 84, 20i-20 4 
through the packet switching network. The messages facilitate the data transfer between host 
computer/server 12 and the bank of disk drives 14 with such data passing through the global 
cache memory 22 via the data transfer section 24. More particularly, in the case of the front- 
end directors I81-I84, the data passes between the host computer to the global cache memory 
22 through the data pipe 31 in the front-end directors I81-I84 and the messages pass through 
the message engine/CPU controller 31 in such front-end directors I81-I84. In the case of the 
back-end directors 20i-20 4 the data passes between the back-end directors 20i-20 4 and the 
bank of disk drives 14 and the global cache memory 22 through the data pipe 38 in the back- 
end directors 2O1-2O4 and again the messages pass through the message engine/CPU 
controller 31 in such back-end director 2O1-2O4. 

[0048] It is noted that here the front-end and back-end directors 18i-18 4 , 20i-20 4 are 
coupled to the crossbar switch 32, as shown in more detail in FIG. 3. The crossbar switch 32, 
here a switching fabric, is also coupled to the packet switching network 26 to provide the 
packet switching network for the system interface 16. 

[0049] Each message which is created by the microprocessor 29 under software 
control is stored in a send queue in RAM, not shown. When the message is to be read from 
the send queue in RAM, not shown, and transmitted through the message network 26 (FIG. 
2) to one or more other directors via a DMA operation it is packetized in a packetizer into a 
packet, shown in FIG. 2A. For directors, each one of the packets includes the following 
fields: command; a source ID (i.e., transmitting director); a destination ID (i.e., receiving 
director) address; a payload; and terminates with a 4-byte Cyclic Redundancy Check (CRC), 
as shown in FIG. 2A. For data passing to and from the global memory, each one of the 
packets includes the following fields: command (e.g., read, write); a source ID (i.e., 
transmitting source) and destination ID (i.e., receiving memory location (i.e., the location in 
global memory for the configuration shown in FIG. 2, or the director board ID having the 
global cache memory section and offset, as with the configuration to be described in FIG. 5). 

[0050] The packet is sent to the crossbar switch 32. The destination portion of the 
packet is used to indicate the destination for the message and is decoded by the switch 32 to 
determine the message routing through the packet switching network section 26. The 
decoding process uses a routing table, not shown, in the packet switching network section 26, 
such routing table being initialized by controller during power-up by the initialization and 
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diagnostic processor (controller), not shown. The routing table provides the relationship 
between the destination address portion of the packet, which identifies the routing for the 
message and the one of directors 1 81- 184, 2O1-2O34 to which the message is to be directed. 

[0051] Here, referring also to FIG. 3, the data pipe 38 also includes a packetizer and 
5 depacketizer to transmit and receive the data to and from the cache memory 22 using the 

same transmission, i.e., packet protocol, as that used by the message engine/CPU controller. 
The destination portion of the packet is used to indicate the destination, i.e., address in the 
cache memory 22 for storage of the data during a write or retrieval of data stored in the cache 
memory 22 during a read and is decoded by the switch 32 to determine the message routing 

10 through the packet switching network 26. The decoding process uses a decoder table, not 

shown, in the packet switching network 26, such routing table being initialized by controller 
during power-up by the initialization and diagnostic processor (controller), not shown. The 
table provides the relationship between the destination address portion of the packet, which 
identifies the routing for the data and the location in the cache memory for writing or reading 

15 the data, as the case may be. 

[0052] Referring now to FIG. 4, an alternative embodiment of the system interface 
16, designated as system interface 16', is shown. Here, a redundant packet switching 
network 16' is included. It should be noted that the system may use a single packet switching 
network where such packet switching network includes sufficient redundant connections to 

20 allow full system connectivity in the event of a failure as shown in FIGS. 8, 9A-9C, to be 
described. 

[0053] Referring now to FIG. 5, an alternative embodiment of the system interface 
16, designated as system interface 16", is shown. Here, the system interface 16" includes a 
plurality of, here for example four, front-end director/memory boards 18'i-18 4 coupled to the 

25 host computer/server 12; a plurality of, here for example, four, back-end director/memory 
boards 20 f i-20 f 4 coupled to the bank of disk drives 14. The interface 16" includes a data 
transfer section having a global cache memory, the global memory cache 22 has the global 
cache memory sections thereof, here sections 22 a -22 n , here 22i-22g distributed among the 
front-end director/memory boards 18 f i-18 4 and the back-end director/memory boards 20V 

30 20 f 4, respectively as shown. 

[0054] An exemplary one of the front-end director/memory boards I8VI84 and the 
back-end director/memory boards 20'i-20 ? 4 is shown in FIG. 6. It is noted that because here 



packet switching is being used, the memory section includes a packetizer and depacketizer to 
enable a common packet switching protocol to be used for the data pipe, the message 
engine/CPU controller and the memory section. The data packet passed to and from the 
global memory is shown in FIG. 2A. It is also noted that a memory section on a 
5 director/memory board is able to communicate with the data pipe on the same board via the 
crossbar switch on such board. Thus, as noted above, the crossbar switches are part of the 
overall packet switching network. Thus, the front-end and back-end directors I81-I84, 20i- 
20 4 and the global cache memory sections 22i-22 8 communicate with one another through the 
packet switching network as described above, such network including the crossbar switches 

10 and the packet switching section 26. 

[0055] Referring now to FIG. 7 A, an alternative embodiment of the system interface 
is shown. Here, one of the boards is a cache memory manager as described in the above- 
referenced U. S. patent application Serial No. 10/180,751. 

[0056] Referring to FIG. 7 an alternative embodiment of the system interface is 

15 shown. Here, instead of coupling each one of the crossbar switches to the packet switching 
network section 26 to provide the packet switching network for the system interface, the 
crossbars switches are interconnected as shown in FIGS. 7 and 8. Thus, as shown in FIG. 8, 
each crossbar switch may be considered as a switching node, here designated as nodes A 
through H. Thus, the crossbar switches for nodes A-H provide a packet switching network 

20 100. 

[0057] Thus, as noted above in connection with FIG. 7, each one of the directors is 
coupled to a crossbar switch. The crossbar switch is directly connected to crossbar switch of 
at least two other ones of the directors and indirectly connected to crossbar switching fabric 
of other ones of the directors through the at least two directly connected directors. Thus: 

25 [0058] Crossbar switch A is directly connected to crossbar switches B, E and H and 

is indirectly coupled to the other ones of the crossbars switches (i.e., switches C, D, F, and G) 
though one of directly connected crossbars switches B, E and H. For example, switch D is 
indirectly coupled to switch A through directly connected switch H. 

[0059] Crossbar switch B is directly connected to crossbar switches A, C and F and is 

30 indirectly coupled to the other ones of the crossbars switches (i.e., switches D, E, G, and H) 
though one of directly connected the crossbars switches A, C and F. 
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[0060] In like manner, crossbar switch C is directly connected to crossbar switches B, 
D and G and is indirectly coupled to the other ones of the crossbars switches. Crossbar 
switch D is directly connected to crossbar switches C, E and H. Crossbar switch E is directly 
connected to crossbar switches D, F and A. Crossbar switch F is directly connected to 
crossbar switches E, G and B and is indirectly coupled to the other ones of the crossbar 
switches. Crossbar switch G is directly connected to crossbar switches H, F and C and is 
indirectly coupled to the other ones of the crossbar switches. Crossbar switch H is directly 
connected to crossbar switches A, G and D and is indirectly coupled to the other ones of the 
crossbar switches. 

[0061] Thus, each one of the directors is coupled to a crossbar switch. The switch is 
directly connected to crossbar switch of at least two other ones of the crossbar switches and 
indirectly connected to other ones of the crossbar switches through the at least two directly 
connected crossbar switches. Thus, as noted above, the packet switching network comprises 
the crossbar switches of the directly and indirectly connected crossbar switch. 

[0062] Referring now to FIG. 9 A, a pair of interconnected packet switching networks 
100i, 100 2 is shown to provide a packet switching network for sixteen directors or sixteen 
director/memory boards. Thus, the connection of the nodes of network 100i to the nodes of 
network IOO2 are presented in below: 

Node A of network 100i is connected directly to Node A of network 100 2 ; 

Node B of network 100i is connected directly to Node B of network 100 2 ; 

Node C of network 100i is connected directly to Node C of network 100 2 ; 

Node D of network 100i is connected directly to Node D of network 100 2 ; 

Node E of network 100i is connected directly to Node E of network 100 2 ; 

Node F of network 100i is connected directly to Node F of network 100 2 ; 

Node G of network 100i is connected directly to Node G of network 100 2 ; 

Node H of network 100i is connected directly to Node H of network 100 2 . 

[0063] It should be noted that with such connections, information sent by each one of 
the nodes (i.e., a source node) of one of the networks 100j, 100 2 may be required to pass to 
two nodes of the other network before reaching a destination node of the other network (i.e., 
two hops are required). For example, considering node G of network 100i as the source node 
and node D of network 100 2 as the destination node, information from the source node G of 
network 100i must pass through, for example, node H of network 100i and then through node 
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H of network IOO2 before reaching destination node D of network IOO2. Likewise, again 
considering node G of network 100i as the source node and node B of network IOO2 as the 
destination node, information from the source node G of network 100i must pass through, for 
example, node F of network 100i and then through node F of network IOO2 before reaching 
destination node D of network IOO2. 

[0064] Referring to FIG. 9B an arrangement is shown which eliminates this two-hop 
condition. Here, the connection of the nodes of network 100i to the nodes of network 100 2 
are presented in below: 

Node A of network 100j is connected directly to Node A of network IOO2; 

Node B of network 100i is connected directly to Node F of network IOO2; 

Node C of network 100i is connected directly to Node C of network IOO2; 

Node D of network 100i is connected directly to Node H of network IOO2; 

Node E of network 100i is connected directly to Node E of network IOO2; 

Node F of network 100j is connected directly to Node B of network IOO2; 

Node G of network 100i is connected directly to Node G of network IOO2; 

Node H of network 100j is connected directly to Node D of network IOO2. 

[0065] Thus, it is noted that here and considering node G of network 100i, such node 
is directly connected to node G of network IOO2. It is further noted that two of the nodes H 
and F of network 100i which are directly connected to node G of node 100i are connected to 
nodes of network IOO2 which are indirectly connected to node G of network IOO2. 

[0066] Thus, in general, here each one of the packet switching networks includes: a 
plurality of crossbar switches, each crossbar switch being connected: (1) directly to crossbar 
switches of at least two other ones of crossbar switches in such one of the packet switching 
networks and to one of the crossbar switches of a second one of the plurality of packet 
switching networks; and, (2) indirectly to other ones of the crossbar switches of such one of 
the packet switching networks via the crossbar switch directly connected to such one of the 
packet switching fabric. Two of said at least two other ones of the crossbar switches in such 
one of the packet switching networks are connected indirectly to two crossbar switches of the 
second one of the packet switching networks, such two crossbar switches of the second one 
of the packet switching networks being crossbar switches connected indirectly to said one of 
the crossbar switches of the second one of the plurality of packet switching networks. 
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[0067] Referring now to FIG. 9C, an arrangement is shown for interconnecting 32 
nodes using four of the networks 100 sown in FIG. 8; i.e., networks IOO1-IOO4. The nodes 
are interconnected using the principle set forth above in connection with FIG. 9B. Thus, the 
interconnections are as forth below: 



Network lOOj 


: Network 1UU2 


Network IUU3 


Network IUU4 


A - E 


A - E 


A T" ' 

A-E 


A V 

A-E 


B-F 


B-F 


B-F 


B-F 


C-G 


C-G 


C-G 


C-G 


D-H 


D-H 


D-H 


D-H 


A-B 


A-B 


A-B 


A-B 


B-C 


B-C 


B-C 


B-C 


C-D 


C-D 


C-D 


C-D 


D-E 


D-E 


D-E 


D-E 


E-F 


E-F 


E-F 


E-F 


F-G 


F-G 


F-G 


F-G 


G-H 


G-H 


G-H 


G-H 


H-A 


H-A 


H-A 


H-A 



Interconnects: 

Node A of network 100j is connected directly to Node A of network IOO2; 
Node B of network 100i is connected directly to Node F of network IOO2; 
Node C of network 100i is connected directly to Node C of network IOO2; 
Node D of network 100i is connected directly to Node H of network IOO2; 
Node E of network 100i is connected directly to Node E of network IOO2; 
Node F of network 100i is connected directly to Node B of network IOO2; 
Node G of network 100i is connected directly to Node G of network IOO2; 
Node H of network 100i is connected directly to Node D of network IOO2. 

Node A of network 100i is connected directly to Node A of network IOO3; 
Node B of network 100i is connected directly to Node F of network IOO3; 
Node C of network 100j is connected directly to Node C of network IOO3; 
Node D of network 100i is connected directly to Node H of network IOO3; 
Node E of network 100i is connected directly to Node E of network IOO3; 
Node F of network 100i is connected directly to Node B of network IOO3; 
Node G of network 100i is connected directly to Node G of network IOO3; 
Node H of network 100i is connected directly to Node D of network IOO3. 
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Node A of network 100i is connected directly to Node A of network 100 4 ; 
Node B of network 100i is connected directly to Node F of network 100 4 ; 
Node C of network 100i is connected directly to Node C of network 100 4 ; 
Node D of network 100i is connected directly to Node H of network IOO4; 
Node E of network 100i is connected directly to Node E of network IOO4; 
Node F of network 100i is connected directly to Node B of network IOO4; 
Node G of network 100i is connected directly to Node G of network IOO4; 
Node H of network 100i is connected directly to Node D of network IOO4. 

Node A of network IOO2 is connected directly to Node A of network IOO3; 
Node B of network IOO2 is connected directly to Node F of network IOO3; 
Node C of network IOO2 is connected directly to Node C of network IOO3; 
Node D of network IOO2 is connected directly to Node H of network IOO3; 
Node E of network IOO2 is connected directly to Node E of network IOO3; 
Node F of network IOO2 is connected directly to Node B of network IOO3; 
Node G of network IOO2 is connected directly to Node G of network IOO3; 
Node H of network IOO2 is connected directly to Node D of network IOO3. 

Node A of network 100 2 is connected directly to Node A of network 100 4 ; 
Node B of network IOO2 is connected directly to Node F of network IOO4; 
Node C of network IOO2 is connected directly to Node C of network IOO4; 
Node D of network IOO2 is connected directly to Node H of network 100 4 ; 
Node E of network IOO2 is connected directly to Node E of network IOO4; 
Node F of network IOO2 is connected directly to Node B of network IOO4; 
Node G of network IOO2 is connected directly to Node G of network IOO4; 
Node H of network IOO2 is connected directly to Node D of network IOO4. 

Node A of network IOO3 is connected directly to Node A of network 100 4 ; 
Node B of network IOO3 is connected directly to Node F of network IOO4; 
Node C of network IOO3 is connected directly to Node C of network 100 4 ; 
Node D of network IOO3 is connected directly to Node H of network IOO4; 
Node E of network IOO3 is connected directly to Node E of network 100 4 ; 
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Node F of network 100 3 is connected directly to Node B of network 100 4 ; 

Node G of network IOO3 is connected directly to Node G of network IOO4; 

Node H of network IOO3 is connected directly to Node D of network IOO4. 

[0068] Thus, considering node F of network 100i as a staring node for example, it is 
noted that connections from such node F of network 100i to all other nodes are direct except 
for: nodes indicated by " 1 " which have one hop; and nodes indicated by "2" which have two 
hops. Thus, there are 16 one-hop interconnections and 6 two-hop interconnections. 

[0069] Referring now to FIG. 10, an unbalanced system is shown. That is, here there 
are six front end director/ memory boards and two back end director/memory boards. It 
follows that other arrangements are equivalent, e.g., six back end and two from end, for 
example. 

[0070] Referring now to FIG. 11, another embodiment is shown. Here, there are 
three front-end director/memory boards, four back end director/memory boards, and a shared 
resource board. The stored resource board includes a crossbar switch connected to the back 
and front end director/memory boards as described above in connection with FIG. 8. Here 
the crossbar switch of the code storage section/memory board is coupled to a global cache 
memory section and a shared code storage section. The shared resource board includes a 
shared resource section which includes a microprocessor and message engine/CPU controller 
described above, a memory, here an EPROM having stored therein computer code used by 
the front end and back end directors and an EPROM controller, arranged as shown. The 
shared resource section also includes a maintenance network connection accessible from 
sources and test equipment, not shown, external to the system interface. Such connection 
allows for updates in code which may now be shared by all directors,. The shared resource 
section also enables reset (i.e. boot-up control) and creation of the routing table described 
above. These features are particularly useful as the system interface becomes updated. It 
also enables environmental monitoring of the system. 

[0071] The EPROM stores shared resources (i.e., resources shared by the front end 
and back end directors) and other system resources. Such an arrangement allows for code 
updates through replacement of this shared code storage section/memory board. It should be 
understood that the board may be made without the shared memory section thereon. The 
code stored in the EPROM may include the boot-up code for the CPUs of the front end and 
back end directors, configuration data, emulation data code, diagnostic code. It is noted that 
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the code in the EPROM can be transferred to the message engine/CPU controller and shared 
memory section on the shared code storage section/memory board via the crossbar switch on 
such shared code storage section/memory board. 

[0072] It should be understood that the use of an EPROM is an example of one such 
device, used to hold code images that can be loaded by the various directors. Thus other 
devices include, for example, non-volatile RAM (NVRAM) for status and error logging, 
Time-of-Day clock, Enclosure management, Fabric initialization hardware and software, 
storage system and data storage management utilities, debug access ports and support, and 
global routing information. 

[0073] Referring now to FIG. 12, another embodiment is shown. Here, there are 
three front end director/memory boards, three back end director/memory boards, and a pair 
of shared code storage section/memory boards. The second shared code storage 
section/memory board provides redundancy in the event that one of the pair of storage 
section/memory board fails. 

[0074] Referring now to FIG. 13, the director shown in FIG. 3 is shown in more 
detail. Thus, the data pipe 38 is shown to include an input/output (I/O) interface, a data 
pipe/Queue controller as well as the packetizer and depacketizer. Note that the data 
pipe/queue controller includes an I/O memory and a translation table to be described in more 
detail below. Suffice it to say here that the I/O interface memory does not include a section 
for an inbound queue for reasons to be described below. The queue for inbound messages are 
stored in the CPU memory of the director which is to execute the inbound message and hence 
a "virtual" queue is shown dotted in the I/O interface memory, it being understood that such 
memory does not store inbound messages producer or consumer indices. 

[0075] The microprocessor memory (i.e., the CPU memory) is shown to include in a 
section thereof an inbound queue. It is noted that the microprocessor memory does not 
include a section for an outbound queue for reasons to be described below. The queue for 
outbound messages are stored in the I/O interface memory of the director which is to execute 
the inbound message and hence a "virtual" queue is shown dotted in the microprocessor 
memory, it being understood that such memory is not required to store outbound message's 
producer or consumer indices. 
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[0076] Referring now to FIG 14, a pair of the directors is shown. Note that for 
purposes of describing the operation of the directors one of the pair of directors will be 
referred to as an originating director and the other the remote director. 

[0077] Referring also to FIGS. 15 and 16, FIG. 15 is a flow diagram shows the 

5 processing of inbound information (i.e., messages) from the originating director to the remote 
director and FIG 16 is a flow diagram shows the processing of outbound information (i.e., 
messages) from the remote director to the originating director. 

[0078] Thus, referring to FIG. 15, an I/O (i.e., information) arrives at the I/O interface 
of the originating director, Step 1500. The I/O interface of the originating director creates a 

10 queue entry, Step 1502. The queue controller of the I/O interface of the originating director 
translates the address of the queue entry into an address for a remote director using a 
translation table (FIG. 17) and packetizes the queue information for transmission on the 
packet switching network to the remote director, Step 1504. Also, the I/O interface of the 
originating director updates its producer index. Step 1506. The queue controller of the 

1 5 originating director translates the updated producer index using the translation table (FIG. 

17), packetizes it, and then writes the translated producer index for the remote director's CPU 
memory's inbound queue rather than storing it in the I/O interface memory, Step 1508. Also, 
the I/O interface of the originating director asserts an interrupt request (optional). It is also 
checks that the producer index does not exceed the consumer index (i.e., an overflow 

20 condition). 

[0079] Step 1510. The queue controller of the originating director relays the 
interrupt to the remote director via the packet switching network (optional), Step 1512. 

[0080] The packet switching network transports the writes and interrupt to the remote 
director via the packet switching network, Step 1514. 
25 [0081] The interrupt is recognized by the remote director or the producer index is 

polled if there is no interrupt, Step 1516. 

[0082] The remote director's CPU reads the information formerly written into it's 
CPU memory pointed to by the producer index sent to it by the originating director's queue 
controller. Step 1518. The CPU of the remote director processes the I/O, Step 1520. Also, 
30 the remote director's CPU updates its consumer index, Step 1522. The packet switching 
network transports the updated consumer index to the I/O interface of the originating 
director, Step 1524. The interface queue controller of the originating director receives the 



consumer index from the remote director and translates the index using the translation table 
and stores the translated consumer index, Step 1526. The inbound queuing is completed, 
Step 1528. 

[0083] Thus, referring to FIG. 16, an originating director decides to send an I/O to a 
remote director, Step 1600. The CPU in the originating director creates a queue entry, Step 
1602. Also, the CPU in the originating director updates its producer index for transmission 
via the packet switching network rather than storing it in its own CPU memory, Step 1604. 

[0084] The packet switching network transports writes to the remote director's I/O 
interface, Step 1606. 

[0085] The queue controller of the remote director's I/O interface translates the 
address received from the originating director into an address in the outbound queue using 
it's own producer index, Step 1608. The queue controller of the remote director's I/O 
interface updates its own producer index and stores it where expected by such I/O interface, 
Step 1610. The remote director's I/O interface recognizes the index update, reads the queue 
and updates its consumer index, Step 1612. The I/O interface performs the action required, 
Step 1614. Also, the queue controller of the remote director sees the consumer index update, 
translates it to the consumer index for the remote director using the translation table (FIG. 
17) and transmits it over the packet switching network to the originating director, Step 1616. 
The packet switching network transports writes to the originating director's I/O interface, 
Step 1618. The consumer index is updated in the remote director (used for overflow control), 
Step 1620. Outbound queuing is completed, Step 1622. 

[0086] Referring now to FIG. 17, the translation table is shown. The address for the 
table, f(x), is a function that takes some part of the information in the queue entry and derives 
an address into the table from it. For example, in the queue entry, there may be a transaction 
ID, a host address, a logical device number, or some field that groups queue entries into some 
logical order. In other cases, the function could simply be a counter, taking each entry to the 
next in the translation table. The "Misc. Remote Info" may contain other information needed 
to access the remote node, such as security keys, queue length, task divisioning, table 
checksums and validity notes, preferences for interrupt, and "scoreboard", for example. 

[0087] In addition to this table, two registers are maintained: "Local Producer Index { 
, and "Local Consumer Index:. These indices (or pointers) are the ones that the I/O Interface 
sees and operates with. There can be multiple queues in either direction; therefore, the 
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"Remote Producer Index" entry in the translation table could be actually two indices 
representing two separate queues with this one translation table. If the address offset is the 
same as the Remote CPU Number, then no field in the table is required. In some cases, a 
Remote Producer Index is not required. If the I/O Interface does not supply a Producer Index, 
the Translation Table is a mechanism for synthesizing one. The update of the Remote 
Producer Index can be programmed to produce an interrupt to the Remote CPU even if the 
local I/O Interface does not generate one, 

[0088] A number of embodiments of the invention have been described. 
Nevertheless, it will be understood that various modifications may be made without 
departing from the spirit and scope of the invention. Accordingly, other embodiments are 
within the scope of the following claims. 



WHAT IS CLAIMED IS: 
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